Pitch analysis methods for cross-speaker comparison
نویسندگان
چکیده
A system of fundamental frequency analysis and normalisation is described for obtaining pitch data and comparing them across speakers. This system was used for the analysis of English and Spanish speakers' productions in order to compare the realization of accentual focus in the two languages. The system is based on the simultaneous recording of speech and the laryngeal signal. The latter is monitored by means of an electrolaryngograph. The analysis is done in three stages: (a) auditory analysis, (b) PCLX analysis obtaining measures in Hz for peaks, troughs and other relevant points in the contour and also fundamental frequency statistics from the complete data set for each speaker, (c) a detailed analysis using SFS with simultaneous display of speech pressure waveform, Lx waveform, excitation period measurements and fundamental frequency trace and with playback facilities for speech and Fo. The analysis described at (c) is used for segmentation, to filter out nodes which are due to micro-intonation, and to pinpoint problem areas in the Fo trace. Outlying and anomalous period measurements may be replaced by a five-point median value. The resulting contours are normalised by converting Hz measures to percentage values (positive or negative) of the speaker's mean Fo which is obtained from the analysis described at (b).
منابع مشابه
Text-independent Speaker Identification System Using Average Pitch and Formant Analysis
The aim of this paper is to design a closed-set text-independent Speaker Identification system using average pitch and speech features from formant analysis. The speech features represented by the speech signal are potentially characterized by formant analysis (Power Spectral Density). In this paper we have designed two methods: one for average pitch estimation based on Autocorrelation and othe...
متن کاملPitch synchronized speech processing (PSSP) for speaker recognition
A method for speech signal enhancement is developed with application to automatic speaker recognition where the signals have different channel conditions. The basis of this technique is a robust pitch detection algorithm that accurately estimates the instantaneous pitch rate, and extracts single pitch period speech segments. This technique of pitch synchronized speech processing (PSSP) provides...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملEvaluation of Joint Position-Pitch Estimation Algorithm for Localising Multiple Speakers in Adverse Acoustical Environments
Automatic speaker localisation, detection and tracking are important challenges in multi-channel hands-free communication systems. In particular, simultaneous localisation of different speakers is of great interest for multi-microphone noise reduction schemes. Besides position, another possible feature to distinguish between different speakers is the fundamental frequency (pitch) of the speaker...
متن کامل